Picture for Bo Peng

Bo Peng

Towards Effective Long-Video Event Prediction via Multi-Level Event Semantics Mining

Add code
May 29, 2026
Viaarxiv icon

Respecting Modality Gap in Post-hoc Out-of-distribution Detection with Pre-trained Vision-Language Models

Add code
May 26, 2026
Viaarxiv icon

Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models

Add code
May 22, 2026
Viaarxiv icon

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

Add code
May 14, 2026
Viaarxiv icon

Uncovering Latent Pathological Signatures in Pulmonary CT via Cross-Window Knowledge Distillation

Add code
May 12, 2026
Viaarxiv icon

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

A transformable slender microrobot inspired by nematode parasites for interventional endovascular surgery

Add code
Apr 15, 2026
Viaarxiv icon

Evidence-Based Actor-Verifier Reasoning for Echocardiographic Agents

Add code
Apr 07, 2026
Viaarxiv icon

Seeking Necessary and Sufficient Information from Multimodal Medical Data

Add code
Feb 27, 2026
Viaarxiv icon

How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective

Add code
Feb 24, 2026
Viaarxiv icon